Model Selection

Medical Image Analysis

# Medical Image Analysis

Medgemma 4b It Bf16

MedGemma-4B-IT is a vision-language model specialized in the medical field, developed by Google and now converted to MLX format for efficient operation on Apple chips.

Medgemma 4b It Q8 0 GGUF

MedGemma-4B-it-Q8_0-GGUF is a GGUF format model converted from google/medgemma-4b-it, specifically designed for image-to-text tasks in the medical field.

Computer Vision Project

This model is fine-tuned based on the DINOv2 architecture for disease classification of skin lesion images

Image Classification English

Aggregate Segmentation

PyTorch-based DeepLabV3Plus image segmentation model supporting efficient semantic segmentation tasks

Image Segmentation

Matiullah2401592

Gemma 3 4b It Abliterated Q4 0 GGUF

This model is a GGUF format conversion of mlabonne/gemma-3-4b-it-abliterated, combined with the visual component of x-ray_alpha for a smoother multimodal experience.

AKI 4B Phi 3.5 Mini

AKI is a multimodal foundation model that achieves cross-modal mutual attention (MMA) by unlocking the causal attention mechanism in LLMs, addressing vision-language misalignment without additional parameters or training time.

Image-to-Text English

A fine-tuned model based on Vision Transformer (ViT) architecture for classifying chest X-rays, trained on the CheXpert dataset.

Image Classification

Transformers English

An ensemble model for predicting breast cancer and breast density from screening mammograms, using 3 CNN networks with different resolutions for inference

Image Classification

Erax VL 7B V2.0 Preview

EraX-VL-7B-V2.0-Preview is a powerful multimodal model designed for OCR and visual question answering, excelling in processing multiple languages including Vietnamese, with outstanding performance in recognizing medical forms, invoices, and other documents.

Transformers Supports Multiple Languages

GenMedClip is a zero-shot image classification model based on the open_clip library, specializing in medical image analysis.

Image Classification

Fpn Tu Resnet18

A PyTorch-implemented FPN image segmentation model that supports various encoder architectures, suitable for semantic segmentation tasks.

Image Segmentation

smp-test-models

Linknet Tu Resnet18

Linknet is a PyTorch-implemented image segmentation model suitable for semantic segmentation tasks.

Image Segmentation

smp-test-models

Vit Base Brain Mri

An image classification model fine-tuned on the BrainMRI dataset based on Google's ViT base model

Image Classification

Bio Medical MultiModal Llama 3 8B V1

A multimodal biomedical model fine-tuned based on Llama-3-8B-Instruct, supporting text and image processing, suitable for biomedical research and clinical applications.

Florence 2 FT Lung Cancer Detection

A lung cancer detection model fine-tuned based on Florence-2-base-ft, identifying lung cancer types through lung images

Transformers English

This model is used for bone age prediction, built upon the YassinHegazy/xray-model base model.

Image Classification

Virchow is a self-supervised vision Transformer pretrained on 1.5 million whole-slide histopathology images, serving as a slide-level feature extractor for computational pathology downstream tasks.

Image Classification

M3D-CLIP is a CLIP model specifically designed for 3D medical imaging, achieving visual and language alignment through contrastive loss.

Multimodal Alignment

Interpret Cxr Impression Baseline

This model can convert medical images (such as X-rays) into descriptive text to assist in medical diagnosis.

Pneumonia Model

A deep learning model based on ViT architecture for identifying pneumonia symptoms in chest X-ray images

Image Classification

Skin Types Image Detection

A facial image classification model using Vision Transformer (ViT) architecture for detecting dry, normal, and oily skin types

Image Classification

Dinov2 Base Xray 224

The AIMI Foundation Model Suite is a collection of foundation models for the radiology domain developed by the Stanford AIMI team, focusing on medical image analysis tasks.

Image Classification

Dinov2 Base Finetuned SkinDisease

A skin disease classification model fine-tuned based on the DINOv2 base model, achieving 95.57% accuracy on the ISIC 2018+Atlas Dermatology dataset.

Image Classification

Breast Cancer SAM V1

Breast cancer segmentation model based on Segment Anything Model (SAM), used for tumor region identification in medical imaging

Image Segmentation

Transformers Supports Multiple Languages

Segformer For Optic Disc Cup Segmentation

A retinal fundus image segmentation model based on the SegFormer architecture, specifically designed for precise segmentation of the optic disc and cup.

Image Segmentation

SkinSAM is a skin lesion segmentation model based on a 12-layer ViT-b architecture, fine-tuned on ISIC and PH2 datasets, focusing on precise segmentation of skin lesion images.

Image Segmentation

Transformers Supports Multiple Languages

Segformer B0 Finetuned Teeth Segmentation

A dental X-ray image segmentation model fine-tuned based on the MIT-B0 architecture, specifically designed for precise segmentation of tooth regions in dental imaging

Image Segmentation

Pubmed Clip Vit Base Patch32

PubMedCLIP is a version of the CLIP model fine-tuned for the medical field, specifically designed to handle medical images and related text.

Text-to-Image English

flaviagiammarino

Efficientnet ParkinsonsPred

A Parkinson's disease prediction model based on the EfficientNet architecture, achieving approximately 83% accuracy by analyzing patient drawings

Image Classification

Transformers Other

Vit Base Patch16 224 Chest X Ray

This model is a fine-tuned version of Google's ViT-base model on a chest X-ray classification dataset, designed for medical image analysis.

Image Classification

An image classification model fine-tuned based on google/vit-base-patch16-224, suitable for histology image analysis tasks.

Image Classification

ClipMD is a medical image-text matching model developed based on OpenAI's CLIP model, employing a sliding window text encoder specifically designed for medical image classification tasks.

Transformers English

A pneumonia detection model based on the ViT architecture, fine-tuned on a chest X-ray classification dataset with an accuracy rate of 97.68%

Image Classification

Detr Resnet 50 CD45RB 1000 Att

A fine-tuned model based on facebook/detr-resnet-50 for object detection tasks

Object Detection

Vit Mlo 512 Birads

An image classification model based on the Vision Transformer architecture, fine-tuned for BIRADS classification tasks

Image Classification

Resnet 50 Finetuned Brain Tumor

A brain tumor image classification model fine-tuned based on microsoft/resnet-50, achieving an accuracy of 91.71% on the evaluation set

Image Classification

Vit Diabetic Retinopathy Classification

A diabetic retinopathy classification model based on the Vision Transformer (ViT) architecture, achieving 72.87% accuracy on the evaluation set

Image Classification

Swin Tiny Patch4 Window7 224 Finetuned Skin Cancer

A fine-tuned model based on the Swin Transformer architecture, specifically designed for skin cancer image classification tasks

Image Classification

Swin Tiny Patch4 Window7 224 Finetuned Skin Cancer

A fine-tuned model based on Swin Transformer architecture, specifically designed for skin cancer image classification tasks

Image Classification

Vit Base Patch16 224 Finetuned Chest

An image classification model fine-tuned on chest image datasets based on Google's ViT model, achieving 99% accuracy

Image Classification

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase